Prior parameter transformation for unsupervised speaker adaptation
نویسندگان
چکیده
In a strictly Bayesian approach, prior parameters are assumed known, based on common or subjective knowledge. But a practical solution for maximum a posteriori adaptation methods is to adopt an empirical Bayesian approach, where the prior parameters are estimated directly from training speech data itself. So there is a problem of mismatches between training and testing conditions in the use of prior parameters. We proposed a prior parameter transformation (PPT) adaptation approach that transforms the prior parameters to be more representative of the new speaker. In this paper we extend it to unsupervised mode. For easily confused speech units, different transformation matrices are applied to make them distinct. Initial experiments show that the PPT algorithm can get much improvement for a small amount of adaptation data even in the unsupervised mode.
منابع مشابه
Extraction of reliable transformation parameters for unsupervised speaker adaptation
Adaptation of speaker-independent hidden Markov models (HMM’s) to a new speaker using speaker-specific data is an effective approach to reinforce speech recognition performance for the enrolled speaker. Practically, it is desirable to flexibly perform the adaptation without any knowledge or limitation on the enrolled adaptation data (e.g. data transcription, length and content). However, the in...
متن کاملExtraction of Reliable Transformation Parameters for Unsupervised Speaer Adaptation
Adaptation of speaker-independent hidden Markov models (HMM’s) to a new speaker using speaker-specific data is an effective approach to reinforce speech recognition performance for the enrolled speaker. Practically, it is desirable to flexibly perform the adaptation without any knowledge or limitation on the enrolled adaptation data (e.g. data transcription, length and content). However, the in...
متن کاملRegression transformation of prior means for speaker adaptation
Maximum a posteriori adaptation method combines the prior knowledge with adaptation data from a new speaker, which has a nice asymptotical property, but has a slow adaptation rate for not modifying unseen models. In a strictly Bayesian approach, prior parameters are assumed known, based on common or subjective knowledge. But a practical solution is to adopt an empirical Bayesian approach, where...
متن کاملOn-line Bayesian speaker adaptation using tree-structured transformation and robust priors
This paper presents new results by using our recently proposed on-line Bayesian learning approach for affine transformation parameter estimation in speaker adaptation. The on-line Bayesian learning technique allows updating parameter estimates after each utterance and i t can accommodate flexible forms of transformation functions as well as prior probability density function. We show through ex...
متن کاملIterative unsupervised adaptation using maximum likelihood linear regression
Maximum likelihood linear regression (MLLR) is a parameter transformation technique for both speaker and environment adaptation. In this paper the iterative use of MLLR is investigated in the context of large vocabulary speaker independent transcription of both noise free and noisy data. It is shown that iterative application of MLLR can be beneficial especially in situations of severe mismatch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000